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Abstract 

T— I ! 

^-H ■ Color quantization is an important operation with many applications in graphics and image processing. Most 
' quantization methods are essentially based on data clustering algorithms. However, despite its popularity as a 
' general purpose clustering algorithm, k-means has not received much respect in the color quantization literature 

h— ^ , because of its high computational requirements and sensitivity to initialization. In this paper, we investigate 

the performance of k-means as a color quantizer. We implement fast and exact variants of k-means with several 

' 2'' initialization schemes and then compare the resulting quantizers to some of the most popular quantizers in the 

Ph ■ . 

' literature. Experiments on a diverse set of images demonstrate that an efficient implementation of k-means with 

c/j ' an appropriate initialization strategy can in fact serve as a very effective color quantizer. 



' 1. Introduction 
> ■ 

^\ , True-color images typically contain thousands of colors, which makes their display, storage, transmission, and 

, processing problematic. For this reason, color quantization (reduction) is commonly used as a preprocessing step for 
various graphics and image processing tasks. In the past, color quantization was a necessity due to the limitations 
of the display hardware, which could not handle over 16 million possible colors in 24-bit images. Although 24- 
bit display hardware has become more common, color quantization still maintains its practical value Modern 
applications of color quantization in graphics and image processing include: (i) compression Q, (ii) segmentation 
^ ' (iii) text localization/detection (iv) color-texture analysis 0, (v) watermarking (vi) non-photorealistic 

rendering Q , and (vii) content-based retrieval Q . 

The process of color quantization is mainly comprised of two phases: palette design (the selection of a small 
set of colors that represents the original image colors) and pixel mapping (the assignment of each input pixel to 
one of the palette colors). The primary objective is to reduce the number of unique colors, A^', in an image to K 
(K <^ N') with minimal distortion. In most applications, 24-bit pixels in the original image are reduced to 8 bits 
or fewer. Since natural images often contain a large number of colors, faithful representation of these images with 
a limited size palette is a difficult problem. 



: image-independent methods that 



Color quantization methods can be broadly classified into two categories 
determine a universal (fixed) palette without regard to any specific image [ij llj , and image-dependent methods 
that determine a custom (adaptive) palette based on the color distribution of the images. Despite being very 
fast, image-independent methods usually give poor results since they do not take into account the image contents. 
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Therefore, most of the studies in the hterature consider only image-dependent methods, which strive to achieve a 
better balance between computational efficiency and visual quality of the quantization output. 

Numerous image-dependent color quantization methods have been developed in the past three decades. These 
can be categorized into two families: preclustering methods and postclustering methods ij. Preclustcring methods 
are mostly based on the statistical analysis of the color distribution of the images. Divisive preclustering methods 
start with a single cluster that contains all N image pixels. This initial cluster is recursively subdivided until K 

12 1, octree variance-based method Q, 

binary splitting 



clusters are obtained. Well-known divisive methods include median-cut 

1 5 L greedy orthogonal bipartitioning [2], optimal principal multilevel quantizer 



and rwm-cut 



preclustcring methods 



19i . More recent methods can be found in |2C 
HQ 
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Q. On 



17[, center-cut 



the other hand, agglomerative 



start with N singleton clusters each of which contains one image pixel. 
These clusters are repeatedly merged until K clusters remain. In contrast to preclustering methods that compute 
the palette only once, postclutering methods first determine an initial palette and then improve it iteratively. 
Essentially, any data clustering method can be used for this purpose. Since these methods involve iterative or 
stochastic optimization, they can obtain higher quality results when compared to preclustering methods at the 
expense of increased computational time. Clustering algorithms adapted to color quantization include k-means 

— n ~i nnnnn nn " 

I34| , minmax [35|| , competitive learning [36|, |33, 138 
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self-organizing maps 
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M |40|, fuzzy c-means [Ml^a, BIRCH 



431, and 



In this paper, we investigate the performance of the k-means (KM) clustering algorithm [47| as a color quantizer. 
We implement several efficient KM variants each one with a different initialization scheme and then compare these 
to some of the most popular color quantizers on a diverse set of images. The rest of the paper is organized as 
follows. Section [2] describes the conventional KM algorithm, a novel way to accelerate it, and several generic 
schemes to initialize it. Section [3] describes the experimental setup, demonstrates the computational advantage of 
the accelerated KM algorithm over the conventional one, and compares the accelerated KM variants with various 
initialization schemes to other color quantization methods. Finally, Section |4] gives the conclusions. 



2. Color Quantization Using K-Means Clustering Algorithm 

2.1. K-Means Clustering Algorithm 

The KM algorithm is inarguably one of the most widely used methods for data clustering [48|. Given a data 
set X = {xi,X2, . . . ,X7v} G K^, the objective of KM is to partition X into K exhaustive and mutually exclusive 
clusters S ~ {Si, 5*2, ... , Sk} , Ua^i = ^ , Si H S'j = for l<i^j<Khy minimizing the sum of squared 
error (SSE): 

K 

k=i xieSfc 

where ||.||2 denotes the Euclidean (L2) norm and is the center of cluster Sk calculated as the mean of the 
points that belong to this cluster. This problem is known to be NP-hard even for K = 2 [4^ or D = 2 Ql, but 
a heuristic method developed by Lloyd 



47| offers a simple solution. 



centers, typically chosen uniformly at random from the data points 
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Lloyd's algorithm starts with K arbitrary 
. Each point is then assigned to the nearest 



center, and each center is recalculated as the mean of all points assigned to it. These two steps are repeated until 



2 



a predefined termination criterion is met. The pseudocode for tliis procedure is given in Algo. ([T|) (bold symbols 
denote vectors). Here, m[i] denotes the membership of point x^, i.e. index of the cluster center that is nearest to 



input ; X = {xi, X2, . . . , xjv} G {N x D input data set) 

output: C = {ci, C2, . . . , ca'} e [K cluster centers) 

Select a random subset C of X as the initial set of cluster centers; 

while termination criterion is not met do 

for (i = 1; i < iV; i = i + 1) do 

Assign Xi to the nearest cluster; 

m[z] = argmin ||Xj-Cfc|l^; 
fee{i,2,...,/f} 

end 

Recalculate the cluster centers; 
for {k^l;k<K;k = k + 1) do 

Cluster Sk contains the set of points x^ that are nearest to the center c^; 

Sk = {xi \m\i] ^ k}; 

Calculate the new center as the mean of the points that belong to Sk', 

end 
end 

Algorithm 1: Conventional K-Means Algorithm 



The complexity of KM is 0{NK) per iteration for a fixed D value. For example, in color quantization applications 
D = 3 since the clustering procedure is often performed in three-dimensional color spaces such as RGB or CIEL*a*b* 



52|. 

From a clustering perspective KM has the following advantages: 
o It is conceptually simple, versatile, and easy to implement. 

o It has a time complexity that is linear in N and K. Furthermore, numerous acceleration techniques are 

'~~',QQ 56, 57, 58|. 



available in the literature [53 



wit 



o It is guaranteed to terminate [59| with a quadratic convergence rate 

The main disadvantages of KM are the facts that it often terminates at a local minimum [59j and that its 
output is sensitive to the initial choice of the cluster centers. From a color quantization perspective, KM has two 
additional drawbacks. First, despite its linear time complexity, the iterative nature of the algorithm renders the 
palette generation phase computationally expensive. Second, the pixel mapping phase is inefficient, since for each 
input pixel a full search of the palette is required to determine the nearest color. In contrast, preclustering methods 
often manipulate and store the palette in a special data structure (binary trees are commonly used), which allows 
for fast nearest neighbor search during the mapping phase. Note that these drawbacks are shared by the majority 
of postclustering methods and will be addressed in the following subsections. 

2.2. Accelerating the K-Means Algorithm 

In order to make it more suitable for color quantization, we propose the following modifications to the conven- 
tional KM algorithm: 
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1. Data sampling: A straightforward way to speed up KM is to reduce tire amount of data, wliicli can be 
acliieved by subsampling the input image data. In this study, two deterministic subsampling methods are 
utilized. The first method involves a 2:1 subsampling in the horizontal and vertical directions, so that only 
1 /4-th of the input image pixels are taken into account 1611 . This kind of moderate sampling has been found to 

.... . nnnn 

be effective in reducing the computational time without degrading the quality of quantization [611 1621 . 1631 124| . 
The second method involves sampling only the pixels with unique colors. These pixels can be determined 
efficiently using a hash table that uses chaining for collision resolution and a universal hash function of the 



form: h^{x.) = '^i^ij mod m, where x (a;i,X2,X3) denotes a pixel with red (a;i), green {X2), and blue 

(cca) components, m is a prime number, and the elements of sequence a = (01,02,03) are chosen randomly 
from the set {0, 1, . . . , m — 1}. This second subsampling method further reduces the image data since most 
images contain a large number of duplicate colors (see §3.1|) . 

2. Sample weighting: An important disadvantage of the second subsampling method described above is that it 
disregards the color distribution of the original image. In order to address this problem, each point is assigned 
a weight that is proportional to its frequency. Note that this weighting procedure essentially generates a 
one-dimensional color histogram. The weights are then normalized by the number of pixels in the image to 
avoid numerical instabilities in the calculations. In addition, Algo. ([T|) is modified to incorporate the weights 
in the clustering procedure. 

3. Sort-Means algorithm: The assignment phase of KM involves many redundant distance calculations. In 
particular, for each point, the distances to each of the K cluster centers are calculated. Consider a point x^, two 
cluster centers Ca and Ch and a distance metric d, using the triangle inequality, we have d{ca, Cf,) < d(xi, + 
(i(xi, Cb). Therefore, if we know that 2d(xi, Cq) < d{ca, Ch), we can conclude that (i(xi, Cq) < d{iii, Cf,) without 
having to calculate (i(xi,ct,). The compare-means algorithm [53j precalculates the pairwise distances between 
cluster centers at the beginning of each iteration. When searching for the nearest cluster center for each point, 
the algorithm often avoids a large number of distance calculations with the help of the triangle inequality 
test. The sort-means (SM) algorithm 53| further reduces the number of distance calculations by sorting the 



distance values associated with each cluster center in ascending order. At each iteration, point x^ is compared 
against the cluster centers in increasing order of distance from the center that x^ was assigned to in the 
previous iteration. If a center that is far enough from is reached, all of the remaining centers can be skipped 
and the procedure continues with the next point. In this way, SM avoids the overhead of going through all 
of the centers. It should be noted that more elaborate approaches to accelerate KM have been proposed in 
the literature. These include algorithms based on kd-trees |54|, 55|, coresets 



uses of the triangle inequality [58| . Some of these algorithms 



57 



561 



57|, and more sophisticated 



are not suitable for low dimensional 



data sets such as color ima ge d ata since they incur significant overhead to create and update auxiliary data 



structures [58[ . Others [5J, l55| provide computational gains comparable to SM at the expense of significant 
conceptual and implementation complexity. In contrast, SM is conceptually simple, easy to implement, and 
incurs very small overhead, which makes it an ideal candidate for color clustering. 
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We refer to the KM algorithm with the abovementioned modifications as the 'Weighted Sort-Means' (WSM) 
algorithm. The pseudocode for WSM is given in Algo. ([2|). Let 7 be the average over all points p of the number of 
centers that are no more than two times as far as p is from the center p was assigned to in the previous iteration. 
The complexity of WSM is 0{K^ + \ogK + N'^) per iteration for a fixed D value, where the terms (from left 
to right) represent the cost of calculating the pairwise distances between the cluster centers, the cost of sorting the 
centers, and the cost of comparisons, respectively. Here, the last term dominates the computational time, since in 
color quantization applications K is a, small number and furthermore K <^ N' . Therefore, it can be concluded that 
WSM is linear in N' , the number of unique colors in the original image. The influence of K on the complexity of 
WSM will be empirically demonstrated in the next section. It should be noted that, when initialized with the same 
centers, WSM gives identical results to KM. 

2.3. Initializing the K- Means Algorithm 

It is well-known in the clustering literature that KM is quite sensitive to initialization. Adverse effects of 
improper initialization include: (i) empty clusters (a.k.a. 'dead units'), (ii) slower convergence, and (iii) a higher 
chance of getting stuck in bad local minima. In this study, the following initialization schemes are investigated: 

n 

• Forgy (FGY) ^]: The cluster centers are chosen randomly from the data set. The complexity of this 
scheme is 0{K). 



• Splitting (LBG) 6J]: The first center Ci is chosen as the centroid of the data set. At iteration i {i G 
{1,2,..., log2 K}), each of the existing 2*~^ centers is split into two new centers by subtracting and adding a 
fixed perturbation vector e, i.e. cj — e and cj -t- e, (j e {1, 2, . . . , 2'^^}). These 2' new centers are then refined 
using the KM algorithm. The complexity of this scheme is 0{NK). 



Minmax (MMX) (65|, |66|, |67[: The first center Ci is chosen randomly and the i-th (i G {2, 3, . . . , K}) center 
Ci is chosen to be the point that has the largest minimum distance to the previously selected centers, i.e. 
Ci, C2, . . . , c,;_i. The complexity of this scheme is 0{NK). 



• Density-based (DEN) [68[: The data space is partitioned uniformly into M cells. From each of these cells, 
a number (that is proportional to the number of points in this cell) of centers is chosen randomly until K 
centers are obtained. The complexity of this scheme is 0{N). 



• Maximum variance ( VAR) [69[ : The data set is sorted (in ascending or descending order) on the dimension 
that has the largest variance and then partitioned into K groups along the same dimension. The centers are 
given by the data points that correspond to the medians of these K groups. The complexity of this scheme 
is 0{N\ogN). 

• Subset Farthest First (SFF) Q: One drawback of the MMX technique is that it tends to find the outliers 
in the data set. Using a smaller subset of size 2KhiK, the total number of outliers that MMX can find is 
reduced and thus the proportion of nonoutlier points obtained as centers is increased. The complexity of this 
scheme is 0{K^ In A'). 



5 



input : X — {xi, X2, . . . , xjv} e {N' x D input data set) 

W = {wi,W2, ■ ■ ■ ,wni} e [0, 1] {N' point weights) 
output: C = {ci, C2, . . . , ck} G (A' cluster centers) 
Select a random subset C of X as the initial set of cluster centers; 
while termination criterion is not met do 

Calculate the pairwise distances between the cluster centers; 
for {i = l;i < K;i = i + 1) do 

for (j = i + 1; j < K; j = j + 1) do 



Construct a KxK matrix AI in which row i is a permutation of 1,2,..., A' that 
represents the clusters in increasing order of distance of their centers from c,;; 
for {i = l;i<N';i = i + l) do 

Let Sp be the cluster that x^ was assigned to in the previous iteration; 

P = m\i]; 

min_dist = prev_dist = ||xi — Cj,||^; 

Update the nearest center if necessary; 

for {j ^2;j < K;j =j + 1) do 

t^M[p][j]; 

if d[p] [t] > 4 prev_dist then 

There can be no other closer center. Stop checking; 

break; 
end 

dist = j|xi — Ctip; 

if dist < minjlist then 

Ct is closer to x^ than Cp] 
min_dist = dist; 
m[i] ~ t; 
end 
end 
end 

Recalculate the cluster centers; 
for (fc = 1; fc < K; fc = fc + 1) do 

Calculate the new center as the weighted mean 

of points that are nearest to it; 



d[i][.j] = d[j][i\ = ||cj -Cjl 



end 



end 




end 



end 



Algorithm 2: Weighted Sort-Means Algorithm 
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• K-MeansH — |- (KPP) 7l|: The first center Ci is chosen randomly and the i-th {i E {2,3,..., K}) center 



is chosen to be x' e X with a probability of 
point X to the previously selected centers. 



r, where -D(x) denotes the minimum distance from a 



In the remainder of this paper, these will be referred to as the generic initialization schemes since they are 
applicable not only to color image data, but also to data with any dimensionality. Among these Forgy's scheme is 
the simplest and most commonly used one. However, as will be seen in the next section, this scheme often leads 
to poor clustering results. Note that there are numerous other initialization schemes described in the literature. 
These include methods based on hierarchical clustering [t^] , genetic algorithms [t^ , simulated annealing Q, [t^ , 
multiscale data condensation [3] , and kd-trees [t^ . Other interesting methods include the global k- means method 

Most of these schemes have quadratic 



[7^, Kaufman and Rousseeuw's method [79[, and the ROBIN method 
or higher complexity in the number of points and therefore are not suitable for large data sets such as color image 
data. 



3. Experimental Results and Discussion 

3.1. Image set and performance criteria 

The proposed method was tested on some of the most commonly used test images in the quantization literature 
(see Figured]). The natural images in the set include Airplane (512 x 512, 77,041 (29%) unique colors). Baboon 
(512 X 512, 153,171 (58%) unique colors). Boats (787 x 576, 140,971 (31%) unique colors), Lenna (512 x 512, 148,279 
(57%) unique colors). Parrots (1536 x 1024, 200,611 (13%) unique colors), and Peppers (512 x 512, 111,344 (42%) 
unique colors). The synthetic images include Fish (300 x 200, 28,170 (47%) unique colors) and Poolballs (510 x 383, 
13,604 (7%) unique colors). 




(a) Airplane 



(b) Baboon 



(c) Boats 



(c) Parrots 



(f) Peppers (g) Fish 

Figure 1: Test images 



(d) Lenna 




(h) Poolballs 
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The effectiveness of a quantization method was quantified by the Mean Squared Error (MSE) measure Q : 

H w 

MSE (X, X) = ^ ^ II x(/i, w) - x(/i, w) \\l (2) 

h=l w=l 

where X and X denote respectively the H x W original and quantized images in the RGB color space. MSE 
represents the average distortion with respect to the L2 norm ([T]) and is the most commonly used evaluation 
measure in the quantization literature Note that the Peak Signal-to- Noise Ratio (PSNR) measure can be easily 
calculated from the MSE value: 

(3) 



/ \ 



The efficiency of a quantization method was measured by CPU time in milliseconds, which includes the time required 
for both the palette generation and pixel mapping phases. In order to perform a fair comparison, the fast pixel 
mapping algorithm described in 8l| was used in quantization methods that lack an efficient pixel mapping phase. 
All of the programs were implemented in the C language, compiled with the gcc v4.2.4 compiler, and executed on 
an Intel Xcon E5520 2.26GHz machine. The time figures were averaged over 100 runs. 

3.2. Efficiency comparison between WSM and KM 

In this subsection, the computational efficiency of WSM is compared to that of KM. In order to ensure fairness 
in the comparisons, both algorithms were initialized with the same randomly chosen centers and terminated after 
20 iterations. Table [T] gives the Number of Distance Calculations (NDC) per pixel and computational times for 
K = {32, 64, 128, 256} on the test images. Note that for KM, NDC always equals the palette size K since the 
nearest neighbor search involves a full search of the palette for each input pixel. In contrast, WSM requires, on the 
average, 8-16 times fewer distance calculations, which is due to the intelligent use of the triangle inequality that 
avoids many calculations once the cluster centers stabilize after a few iterations. Most KM acceleration methods 
incur overhead to create and update auxiliary data structures. This means that speed up when compared to KM 
is less in CPU time than in NDC [58|. Table [1] shows that this is not the case for WSM since it exploits the color 
redundancy in the original images by reducing the amount of data before the clustering phase. It can be seen that 
WSM is actually about 12-20 times faster than KM. Note that the speed up for a particular image is inversely 
proportional to the number of unique colors in the image. Therefore, the most significant computational savings 
are observed on images with relatively few unique colors such as Parrots (13% unique colors) and Poolballs (7% 
unique colors). 

Figure [2] illustrates the scaling behavior of WSM with respect to K. It can be seen that, in contrast to KM, the 
complexity of WSM is sublinear in K. For example, on the Parrots image, increasing K from 16 to 256, results in 
only about 3.9 fold increase in the computational time (164 ms vs. 642 ms). 

3.3. Comparison of WSM against other quantization methods 

The WSM algorithm was compared to some of the well-known quantization methods in the literature: 



• Median-cut (MC) [12|: This method starts by building a 32 x 32 x 32 color histogram that contains the 
original pixel values reduced to 5 bits per channel by uniform quantization. This histogram volume is then 
recursively split into smaller boxes until K boxes are obtained. At each step, the box that contains the largest 
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Figure 2: CPU time for WSM for K = {2,3,..., 256} 

number of pixels is split along the longest axis at the median point, so that the resulting subboxes each contain 
approximately the same number of pixels. The centroids of the final K boxes are taken as the color palette. 



• Otto's method (OTT) [84I: This method is similar to MC with two exceptions: no uniform quantization is 
performed and at each step the box that gives the maximum reduction in the total squared deviation is split. 
The split axis and split point are determined by exhaustive search. 



• Octree (OCT) [13|: This two-phase method first builds an octree (a tree data structure in which each 
internal node has up to eight children) that represents the color distribution of the input image and then, 
starting from the bottom of the tree, prunes the tree by merging its nodes until K colors are obtained. In the 
experiments, the tree depth was limited to 6. 

• Variance- based method (WAN) [l^: This method is similar to MC with the exception that at each step 
the box with the largest weighted variance (squared error) is split along the major (principal) axis at the 
point that minimizes the marginal squared error. 

• Greedy orthogonal bipartitioning (WU) [l^: This method is similar to WAN with the exception that 
at each step the box with the largest weighted variance is split along the axis that minimizes the sum of the 
variances on both sides. 

• Binary splitting method (BS) This method is similar to WAN with two exceptions: no uniform 
quantization is performed and at each step the box with the largest eigenvalue is split along the major axis 
at the mean point. 
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Neu-quant (NEU) [44|: This method utihzes a one-dimensional self-organizing map (Kohonen neural net- 
work) with 256 neurons. A random subset of N / f pixels is used in the training phase and the final weights of 
the neurons are taken as the color palette. In the experiments, the highest quality configuration, i.e. / = 1, 
was used. 



Modified minmax (MMM) [35|: This method chooses the first center Ci arbitrarily from the data set 
and the z-th center C; (i = 2, 3, . . . , K) is chosen to be the point that has the largest minimum weighted 
distance (the weights for the red, green, and blue channels are taken as 0.5, 1.0, and 0.25, respectively) to the 
previously selected centers, i.e. Ci, C2, . . . , Ci_i. Each of these initial centers is then recalculated as the mean 
of the points assigned to it. 



Split Merge (SAM) [29|: This two-phase method first partitions the color space uniformly into B 
partitions. This initial set of B clusters is represented as an adjacency graph. In the second phase, {B — K) 
merge operations are performed to obtain the final K clusters. At each step of the second phase, the pair 
of clusters with the minimum joint quantization error are merged. In the experiments, the initial number of 
clusters was set to _B = 2QK. 



Fuzzy c-Means with partition index maximization (PIM) [4l|: Fuzzy c-Means (FCM) [83 is a gener- 
alization of KM in which points can belong to more than one cluster. The algorithm involves the minimization 
of the functional Jg(C/, V) = X)t=i S^Li ^ifc IN« ~ ^k\\\ with respect to U (a fuzzy A'-partition of the data set) 
and V (a set of prototypes - cluster centers). The parameter q controls the fuzziness of the resulting clusters. 
At each iteration, the membership matrix U is updated by Uik = (^J2f=i (ll^i ^ '^A;ll2/ll^i ^ ^j\\2)'^^^'' ' 
which is followed by the update of the prototype matrix V by Vfc = (^J2iLi ''^ik^ij j (SiLi ^ naive 

implementation of the FCM algorithm has a complexity of 0{NK^) per iteration, which is quadratic in 



the number of clusters. In the experiments, a linear complexity formulation, i.e. 0{NK), described in 84 1 
was used and the fuzziness parameter was set to q = 2 as commonly seen in the fuzzy clustering liter- 
ature 48|. PIM is an extension of FCM in which the functional to be minimized incorporates a cluster 



validity measure called the 'partition index' (PI). This index measures how well a point has been clas- 
sified and is defined as Pi = X^aLi "?fe- "^^^"^ FCM functional can be modified to incorporate PI as follows: 
Jq{U, V) ~ X^iLi SfcLi W'^i ^ '^k\\\ ^ QiX^iLi Pi- The parameter a controls the weight of the second term. 
The procedure that minimizes Jg{U, V) is identical to the one used in FCM except for the membership ma- 
trix update equation: Uik = (X^jLi [dl^i ^ '^k\\2 ^ <^)/(ll^'i ^ "^jW-? ^ a)]^^''' ■ An adaptive method to 



^3= 

determine the value of a is to set it to a fraction < S < 0.5 of the distance between the nearest two centers, 

the fraction value was set to S = 0.4. 



41 



i.e. a = (5min ||vi — Vj||2. Following 
Competitive learning clustering (ADU) 



ft 



Adaptive Distributing Units (ADU) algorithm 



This method is an adaptation of Uchiyama and Arbib's 



36| to color quantization. ADU is a competitive learning 
algorithm in which units compete to represent the input point presented in each iteration. The winner is 
then rewarded by moving it closer to the input point at a rate of 7 (the learning rate). The procedure starts 
with a single unit whose center is given by the centroid of the input points. New units are added by splitting 
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existing units that reach the threshold number of wins 8 until the number of units reaches K. Following 39[, 
the algorithm parameters were set to 9 = 400^7^, tmax = (2A' — 3)6', and 7 = 0.015. 

Fourteen variants of WSM each with a different initialization scheme were implemented. These include variants 
that utihze the generic initiahzation schemes discussed in ^Z3l namely WSM-FGY, WSM-LBG, WSM-MMX, WSM- 
DEN, WSM-VAR, WSM-SFF, and WSM-KPP, as well as variants that use the abovementioned preclustering 
methods as initializers, i.e. WSM-MC, WSM-OTT, WSM-OCT, WSM- WAN, WSM-WU, WSM-BS, and WSM- 
SAM. Each variant was executed until it converged. Convergence was determined by the following commonly used 
criterion [6J]: (SSEj„i — SSE.i)/SSEi < e, where SSE^ denotes the SSE ^ value at the end of the i-th iteration. 
The convergence threshold was set to e ~ 0.001. 

Tables [2][S] compare the effectiveness of the methods for 32, 64, 128, and 256 colors, respectively on the test 
images. The best (lowest) error values are shown in bold. Similarly, Tables |6]|9] give the efficiency comparison of 
the methods. In addition, for each K value, the methods are first ranked based on their MSE values for each image. 
These ranks are then averaged over all test images. The same is done for the CPU time values. Table [TOl gives the 
mean MSE and CPU time ranks of the methods. The last column gives the overall mean ranks with the assumption 
that both criteria have equal importance. Note that the best possible rank is 1. The following observations are in 
order: 

> In general, the postclustering methods are more effective but less efficient than the preclustering methods. 

> The most effective preclustering methods are BS, OTT, and WU. The least effective ones are MC, WAN, and 
MMM. 

^ The most effective postclustering methods are WSM-WU, WSM-BS, WSM- WAN, WSM-SAM, and WSM- 
LBG. Note that two of these methods, namely WSM- WAN and WSM-SAM, utihze initialization methods that 
are quite ineffective by themselves. (The MSE ranks of WAN and SAM are 24.00 and 20.19, respectively.) 
The least effective postclustering methods are PIM, NEU, and WSM-MMX. 

> Preclustering methods are generally more effective and efficient (especially when K is small) initializers when 
compared to the generic schemes. This was expected since the former methods are designed to exploit the 
peculiarities of color image data such as limited range and sparsity. Therefore, they are particularly suited 
for time-constrained applications such as color based retrieval from large image databases, where images are 
often reduced to a few colors prior to the similarity calculations ^ . 

> In general, WSM-WU is the best method. This method is not only the overall most effective method, but 
also the most efficient postclustering method. In each case, it obtains one of the lowest MSE values within a 
fraction of a second. 

> In general, the fastest method is MC, which is followed by WU, WAN, and SAM. The slowest methods are 
PIM, WSM-LBG, MMM, ADU, NEU, and BS. 

Table fTTl gives the number of iterations that each WSM variant requires until reaching convergence. As before, 
for each K value, the methods are first ranked based on their iteration counts for each image. These ranks arc then 
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averaged over all test images. Table [T^] gives the results, with the last column representing the overall mean ranks. 
The correlation coefficient between this column and the overall mean MSE ranks (column 6 of Tabic [TO)) is 0.882, 
which indicates that WSM often takes longer to converge when initialized by an ineffective preclustering method. 
Interestingly, despite the fact that it converges the fastest, WSM-LBG is one of the slowest quantization methods 
because of its costly initialization phase, i.e. the LBG algorithm. In fact, the correlation coefficient between the 
overall mean iteration count ranks and the overall mean CPU time ranks (column 11 of Tabic [T0|) is 0.034, which 
indicates a lack of correlation between the number of iterations and computational speed. 

It should be noted that initializing a postclustcring method such as KM (or WSM) using the output (color 
palette) generated by a preclustering method is not a new idea. Numerous early studies ' ~ ~ ' ^ 
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investigated this particular two-phase quantization scheme and concluded that slight improvements (reductions) in 
the MSE due to the use of KM is largely offset by the dramatic increase in the computational time. Table [13] gives 
the percent MSE improvements obtained by refining the outputs generated by the preclustering methods using 
WSM. For example, when K — 32, WSM-MC obtains, on the average, 42% lower MSE when compared to MC on 
the test images. It can be seen that WSM improves the MSE values by an average of 18-50%. When combined 
with its significant computational efficiency (see ^3.2p , these improvements show that the conclusions made for KM 
in the abovcmcntioncd studies are not valid for WSM. The correlation coefficient between the mean percent MSE 
improvement values (last column of Table fT3|) and the overall mean MSE ranks (column 6 of Tabic [TO|l is 0.988, 
which indicates that WSM is much more likely to obtain a significant MSE improvement when initialized by an 
ineffective preclustering method such as MC or WAN. This is not surprising given that such ineffective methods 
generate outputs that are likely to be far from a local minimum and hence WSM can significantly improve upon 
their results. Nevertheless, it can be said that WSM benefits even highly effective preclustering methods such as 
BS, OTT, and WU. 

Figure [3] shows sample quantization results and the corresponding error images for a close-up part of the Baboon 
image. The error image for a particular quantization method was obtained by taking the pixclwise absolute difference 
between the original and quantized images. In order to obtain a better visualization, pixel values of the error images 
were multiplied by 4 and then negated. It can be seen that PIM, NEU, BS, and WU are unable to represent the 
color distribution of the sclera (yellow part of the eye). This is because, this region is a relatively small part of the 
face and therefore, despite its visual significance, it is assigned representative colors that are derived from larger 
regions with different color distributions, e.g. the red nose. In contrast, WSM variants, i.e. WSM-BS and WSM-WU, 
perform significantly better in allocating representative colors to the sclera, resulting in cleaner error images. 

Figure [4] shows sample quantization results and the corresponding error images for a close-up part of the Peppers 
image. It can be seen that PIM, NEU, MC, and WAN are particularly ineffective around the edges. On the other 
hand, WSM variants, i.e. WSM-MC and WSM- WAN, are significantly better in representing this edgy region. Once 
again, the refinement due to WSM is remarkable for the preclustering methods WAN and, in particular, MC. 

We should also mention a recent study by Chen et al. that involves color quantization and the KM algorithm 



871. In their method, the input image is first quantized uniformly in the Hue-Saturation- Value (HSV) color space 
to obtain a color histogram with 30 x 7 x 7 bins and a grayscale one with 8 bins. Initial cluster centers are then 



determined from each histogram using a modified MMX procedure that selects a maximum of 10 centers using an 
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empirically determined distance threshold. Finally, the histograms are jointly clustered using the KM algorithm 
and the resulting image is post-processed to eliminate small regions. To summarize, this method aims to partition 
the input image into a number of homogeneously colored regions using an image-independent quantization scheme 
and histogram clustering. In contrast, the proposed methods aim to reduce the number of colors in the input image 
to a predefined number using an image-dependent scheme. However, both approaches involve KM clustering on 
histogram data. 



Table 1: NDC and CPU time comparison between WSM and KM 



K 


Criterion 


Method 


AIR 


BBN 


BTS 


LEN 


PAR 


PEP 


FSH 


PLB 


Mean 






KM 


32 


32 


32 


32 


32 


32 


32 


32 


32 




NDC 


WSM 


3.20 


4.02 


4.00 


3.00 


4.09 


4.04 


4.37 


5.14 


3.98 


32 




KMiWSM 


10.00 


7.95 


8.00 


10.67 


7.83 


7.91 


7.33 


6.23 


8.24 




KM 


802 


899 


1415 


858 


4718 


828 


192 


551 


1283 




Time 


WSM 


58 


153 


134 


118 


207 


109 


31 


20 


104 






KMiWSM 


13.64 


5.87 


10.52 


7.24 


22.76 


7.59 


6.08 


26.90 


12.58 






KM 


64 


64 


64 


64 


64 


64 


64 


64 


64 




NDC 


WSM 


4.62 


5.73 


5.69 


4.45 


5.89 


6.18 


6.25 


6.66 


5.68 


64 




KMiWSM 


13.86 


11.17 


11.24 


14.39 


10.86 


10.36 


10.24 


9.61 


11.47 




KM 


1540 


1671 


2708 


1630 


9392 


1600 


367 


1069 


2497 




Time 


WSM 


81 


191 


172 


174 


259 


150 


41 


27 


137 






KMiWSM 


18.92 


8.74 


15.66 


9.34 


36.20 


10.66 


8.84 


38.76 


18.39 






KM 


128 


128 


128 


128 


128 


128 


128 


128 


128 




NDC 


WSM 


7.70 


9.11 


9.33 


7.47 


9.30 


9.90 


10.58 


11.17 


9.32 


128 




KMiWSM 


16.62 


14.05 


13.71 


17.13 


13.76 


12.93 


12.10 


11.45 


13.97 




KM 


2992 


3039 


5153 


3096 


17919 


3098 


695 


2128 


4765 




Time 


WSM 


131 


280 


257 


274 


431 


221 


76 


61 


216 






KMiWSM 


22.76 


10.85 


20.00 


11.30 


41.49 


13.96 


9.10 


34.38 


20.48 






KM 


256 


256 


256 


256 


256 


256 


256 


256 


256 




NDC 


WSM 


13.80 


15.53 


15.62 


13.41 


15.64 


16.41 


18.18 


20.51 


16.14 


256 




KMiWSM 


18.56 


16.48 


16.39 


19.08 


16.37 


15.60 


14.08 


12.48 


16.13 




KM 


5849 


5820 


10048 


5793 


34786 


5853 


1347 


4160 


9207 




Time 


WSM 


304 


662 


463 


434 


688 


429 


205 


246 


429 






KMiWSM 


19.21 


8.78 


21.66 


13.34 


50.53 


13.62 


6.57 


16.89 


18.83 



Table 2; MSE comparison of the quantization methods (K = 32) 



Method 


AIR 


BBN 


BTS 


LEN 


PAR 


PEP 


FSH 


PLB 


MC 


123.9 


546 





200 


2 


205 


7 


401 


2 


332 


7 


275 


9 


136.3 


OTT 


92.4 


365 


8 


156 


3 


141 


4 


252 


2 


246 


5 


157 


6 


67.7 


OCT 


101.6 


460 


3 


174 


5 


186 


2 


343 


5 


306 


8 


218 


2 


130.4 


WAN 


116.9 


509 


1 


198 


5 


216 


2 


364 


7 


333 


3 


310 


5 


111.7 


WU 


75.3 


421 


8 


154 


5 


157 


8 


291 


1 


264 


2 


186 


6 


68.3 


BS 


73.6 


388 


8 


136 


5 


138 


3 


298 


9 


261 


7 


161 


4 


89.0 


NEU 


101.5 


363 


1 


147 


3 


135 


1 


306 





248 


7 


172 


8 


103.5 


MMM 


134.3 


488 


9 


203 


1 


184 


9 


331 


6 


291 


8 


234 


8 


165.6 


SAM 


119.8 


395 


9 


161 


4 


158 


3 


275 


8 


267 


9 


198 


3 


90.5 


PIM 


73.9 


412 





161 


4 


159 





295 


5 


265 


5 


170 





134.8 


ADU 


84.5 


331 


9 


120 


4 


120 


6 


238 


8 


222 


5 


149 


6 


131.9 


WSM-FGY 


58.6 


331 


3 


117 


9 


121 





235 


8 


221 


7 


147 


9 


64.1 


WSM-LBG 


59.1 


327 


3 


117 


9 


119 


6 


229 


3 


220 


7 


142 


2 


69.3 


WSM-MMX 


68.5 


331 


1 


117 


9 


126 


6 


236 


7 


225 


5 


152 


6 


103.1 


WSM-DEN 


58.7 


332 


6 


117 


8 


120 


9 


237 


1 


221 


9 


147 


6 


61.2 


WSM-VAR 


59.1 


334 


9 


116 


9 


119 


1 


229 


5 


220.1 


147 


1 


68.5 


WSM-SFF 


59.7 


329 


6 


117 


7 


119 


9 


235 


1 


220.8 


145 


4 


89.7 


WSM-KPP 


58.9 


330 


6 


116 


5 


120 





233 


1 


221.0 


142 


3 


74.1 


WSM-MC 


65.0 


335 


9 


117 


7 


119 


8 


231 


4 


221.8 


148 


1 


71.0 


WSM-OTT 


66.9 


337 


1 


121 


4 


120 


9 


227 


2 


224.4 


141 


7 


53.6 


WSM-OCT 


58.2 


327 


1 


117 


4 


120 


9 


239 


2 


226.7 


141 


.1 


73.4 


WSM-WAN 


58.2 


326 


6 


116 


4 


118 


5 


233 


7 


227.3 


143 


8 


50.5 


WSM-WU 


56.0 


329 


6 


115 





118 


2 


222 


5 


220.4 


141.3 


50.3 


WSM-BS 


58.9 


332 


3 


116 


9 


118 


4 


228 





221.7 


141.9 


54.4 


WSM-SAM 


58.5 


327 





116 


4 


118 


8 


227 


9 


220.7 


143.1 


67.2 



4. Conclusions 

In this paper, the k-means clustering algorithm was investigated from a color quantization perspective. This 
algorithm has been criticized in the quantization literature because of its high computational requirements and sen- 
sitivity to initialization. We first introduced a fast and exact k-means variant that utilizes data reduction, sample 
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Tabic 3; MSE comparison of the quantization methods (K = 64) 



Method 


AIR 


BBN 


BTS 


LEN 


PAR 


PEP 


FSH 


PLB 


MC 


81 2 


371 


126 4 


139 2 


258 


212 8 


169 5 


63 7 


\J 1 1 


OD.y 


222 9 


c5D.^S 


HQ Q 

oy .y 


144 6 


1 "^9 
lOZ.O 


Q7 7 

y / . / 


9Q n 




54 4 


zoy .0 


QQ 7 

yy. / 


1 riQ 7 


1 RS Q 


1 7Q S 


124 7 


/I fi n 


WAN 


Dy .0 


OZD.41: 


1 ID.D 


140 4 


225 3 


215 1 


9nR 


f^Q /I 


vv u 


47.0 


2,4:7 .Q 


86.9 


98.9 


170.8 


160.2 


111,4 


01.4 


-Do 


42 1 


ZOO.41: 


77 1 
i i .1 


O.S.0 


162 2 


1 Rn n 

IDU.U 


1 nn 1^ 


00.0 


NEU 


/I fi Q 


216 2 


7Q Q 

( y.i 




lOo.U 


151 1 


107 2 


AIR 
■±0.0 


iViiViiVi 


HI ^ 


zoy.o 


1 1 Q 

1 lo.y 


1 lO.O 


ZUU.41: 


1 R1 R 


1 Tfi 
loD. D 


Q1 T 

y 1 .0 


SAM 




245 1 


yo.o 




1 fin 


1 Rn R 

IDU.D 


120 1 


^A 1 

041:. 


PIM 


44 3 


261 2 


1 nn fi 

lUU.D 


QQ 1 

yy . 1 


1 7"? Q 


1 7R n 


111 3 


f^R ^ 
OD.O 


ADU 


41:0. y 


ly f .u 


Du.U 


79 Q 

f ^.y 


1 "iO ^ 
loz.o 


loo. 41: 


Qn n 
yu.u 


fi/1 n 

D41:.U 


vv OiVi-Jr Kjr 1 






DO.U 


7'^ 7 


129 4 


134 2 


RR R 


9Q R 

zy .(5 


VV OiVI-ijJjVj 


T /I n 
o'±.\j 


1 QR Q 

lyo.y 


R/1 n 

041:.U 


71 9 


1 97 R 


131 4 


R/1 R 


9R Q 


VV OiVi-iViiViA. 


38.8 


198.6 


66.4 


74.9 


131.3 


134.8 


94.0 


59.4 


vv OiVi-UJzjiN 




1 QS ^ 

lyo.o 


DO.Z 


7'^ R 


1 9Q 7 


1 '^^ 7 
loo. / 


RQ 7 

oy. / 


28 4 


WSM-VAR 


33.8 


199.2 


64.7 


72.8 


138.6 


133.6 


87.3 


24.0 


WSM-SFF 


36.8 


198.2 


66.3 


72.9 


129.9 


133.7 


89.9 


46.2 


WSM-KPP 


35.0 


197.7 


64.7 


73.0 


128.7 


133.5 


86.1 


29.6 


WSM-MC 


38.7 


200.0 


64.9 


75.4 


127.0 


134.7 


90.1 


31.1 


WSM-OTT 


41.9 


197.3 


67.6 


72.4 


127.1 


131.3 


86.3 


23.7 


WSM-OCT 


36.4 


196.3 


65.0 


73.4 


127.9 


132.6 


86.1 


30.2 


WSM-WAN 


34.2 


197.7 


63.4 


71.9 


126.0 


134.3 


85.0 


22.0 


WSM-WU 


34.3 


196.5 


63.5 


72.0 


125.2 


131.4 


84.7 


21.8 


WSM-BS 


34.6 


196.3 


63.9 


71.9 


126.3 


131.9 


84.4 


22.5 


WSM-SAM 


35.9 


195.4 


64.2 


72.9 


125.0 


131.5 


86.0 


28.3 



weighting, and accelerated nearest neighbor search. This fast k-means algorithm was then used in the implemen- 
tation of several quantization methods each one featuring a different initialization scheme. Extensive experiments 
on a large set of classic test images demonstrated that the proposed k-means implementations outperform state-of- 
the-art quantization methods with respect to distortion minimization. Other advantages of the presented methods 
include ease of implementation, high computational speed, and the possibility of incorporating spatial information 
into the quantization procedure. 

The implementation of the k-means based quantization methods will be made publicly available as part of the 
Fourier image processing and analysis library, which can be downloaded fromj http : //sourcef orge .net/pro j ects/| 
|f ourier-ipal[ 
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Table 5: MSE comparison of the quantization methods {K = 256) 
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Table 6: CPU time comparison of the quantization methods {K = 32) 
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Table 7; CPU time comparison of the quantization methods {K = 64) 
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Table 8: CPU time comparison of the quantization methods {K = 128) 
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Table 9: CPU time comparison of the quantization methods {K = 256) 
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Table 10: MSE and CPU time rank comparison of the quantization methods 
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17.48 


MMM 


23.38 


23.50 


23.00 


22.13 


23.00 


20.88 


22.75 


22.00 


20.50 


21.53 


22.27 


SAM 


19.88 


20.00 


20.13 


20.75 


20.19 


2.50 


2.63 


4.13 


4.13 


3.34 


11.77 


PIM 


19.63 


20.13 


21.00 


21.88 


20.66 


25.00 


25.00 


25.00 


25.00 


25.00 


22.83 


ADU 


14.13 


12.50 


10.00 


9.25 


11.47 


9.75 


19.88 


23.50 


23.88 


19.25 


15.36 


WSM-FGY 


9.50 


10.25 


9.75 


8.63 


9.53 


14.25 


13.63 


13.63 


15.13 


14.16 


11.84 


WSM-LBG 


7.00 


4.13 


6.63 


5.88 


5.91 


22.13 


22.00 


22.00 


22.50 


22.16 


14.03 


WSM-MMX 


13.50 


14.88 


15.50 


14.88 


14.69 


16.38 


16.38 


18.00 


20.38 


17.78 


16.24 


WSM-DEN 


10.00 


10.00 


9.50 


9.13 


9.66 


15.13 


13.25 


14.63 


15.25 


14.56 


12.11 


WSM-VAR 


7.38 


8.25 


7.13 


7.63 


7.60 


13.38 


13.00 


12.63 


14.00 


13.25 


10.42 


WSM-SFF 


9.13 


11.75 


12.50 


13.25 


11.66 


10.38 


11.50 


13.13 


16.50 


12.88 


12.27 


WSM-KPP 


7.63 


8.25 


8.13 


7.75 


7.94 


16.50 


16.63 


17.75 


17.38 


17.06 


12.50 


WSM-MC 


10.25 


12.00 


14.13 


14.75 


12.78 


12.38 


14.88 


16.25 


15.88 


14.84 


13.81 


WSM-OTT 


9.25 


7.38 


8.63 


8.88 


8.53 


12.25 


11.25 


10.75 


13.00 


11.81 


10.17 


WSM-OCT 


8.38 


8.13 


7.25 


6.00 
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16.75 
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12.75 


11.50 
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10.55 


WSM-WAN 


5.50 


4.50 


4.13 


6.25 


5.09 


8.50 


8.75 


8.75 


12.63 


9.66 


7.38 


WSM-WU 


1.88 


2.88 


2.75 


2.00 


2.38 


9.38 


8.50 


8.38 


8.13 


8.59 


5.48 


WSM-BS 


5.88 


3.38 


2.38 


1.88 
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19.75 


18.13 
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13.50 


17.00 


10.19 


WSM-SAM 


4.00 
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6.38 


7.88 


5.81 


7.50 


6.63 


8.63 
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8.75 


7.28 
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Table 11; Iteration count comparison of the WSM variants 





AIR 


BBN 


BTS 


LEN 


PAR 


PEP 


FSH 


PLB 


if = 32 




WSM-FGY 


32 


23 


30 


29 


24 


27 


28 


19 


WSM-LBG 


6 


4 


18 


3 


16 


8 


5 


15 


WSM-MMX 


34 


21 


29 


22 


26 


24 


31 


20 


WSM-DEN 


31 


24 


34 


29 


23 


27 


23 


24 


W oM- VAK. 


18 


24 


17 


21 


25 


22 


26 


22 


WSM-SFF 


35 


19 


25 


19 


23 


22 


25 


16 




23 


20 


26 


21 


22 


21 


20 


15 


WO TV/I" A/rr^ 


29 


13 


29 


19 


17 


25 


30 


31 


VV 01V±— -L J- 


71 


18 


30 


21 


12 


14 


11 


18 


WSM OPT 


26 


24 


31 


29 


23 


15 


13 


22 


WSA/T WAN 


18 


21 


18 


12 


19 


12 


18 


12 


wcA/r WTT 

VV OiVl- VV u 


9 


13 


23 


18 


16 


9 


13 


16 


VV OiVl-JZ>0 


14 


13 


10 


11 


8 


13 


10 


10 


WSM SAM 
VV oiv±— o.r\iv± 


22 


8 


19 


14 


7 


18 


12 


14 


K = 64 




WSM-FGY 


28 


25 


29 


28 


25 


30 


29 


20 


WSM-LBG 


6 


3 


11 


6 


19 


15 


6 


10 


WSM-MMX 


44 


25 


40 


26 


35 


29 


24 


15 


WSM-DEN 


29 


24 


30 


27 


25 


29 


25 


21 


W oM- VAii. 


32 


26 


23 


22 


20 


27 


25 


21 


WSM-SFF 


37 


21 


32 


22 


27 


24 


22 


16 


WSM KPP 

VV OiVJ.— IVJT JT 


24 


20 


27 


20 


23 


23 


22 


14 


WSA/T MC 

VV OiVl-iViv^ 


43 


20 


43 


22 


35 


44 


30 


19 


VV OiVl-V_y ± J. 


63 


17 


32 


20 


15 


25 


12 


12 


WSAA or'T 


30 


20 


18 


17 


16 


24 


19 


27 


WSA/T WA N" 

VV OiVl- VV-TliN 


15 


18 


17 


19 


17 


16 


19 


15 


WSA/T WTT 

VV OiVl- VV u 


13 


12 


21 


17 


13 


16 


20 


9 


WSM RS 

V V O iV±— i_j o 


12 


16 


14 


13 


14 


17 


13 


12 


VV OiVl-OrtiVl 


13 


9 


20 


15 


12 


11 


12 


18 


K = 128 




WSM-FGY 


26 


23 


28 


26 


26 


28 


23 


19 


WSM-LBG 


11 


8 


14 


7 


25 


15 


5 


4 


WSM-MMX 


48 


27 


48 


27 


39 


33 


26 


13 


WSM-DEN 


26 


24 


27 


26 


25 


27 


24 


19 


ISfQ A/T 17" A "D 


22 


19 


25 


24 


21 


27 


20 


16 


WSM-SFF 


30 


23 


40 


22 


30 


27 


22 


16 


WSM KPP 

VV OiV±— IVJT JT 


22 


20 


26 


20 


25 


23 


18 


13 


WSM MC 

VV OiV±— iVlV^ 


54 


23 


48 


29 


35 


33 


22 


16 


WSM OTT 


18 


18 


51 


24 


17 


20 


22 


10 


WSAA or'T 


21 


19 


32 


20 


19 


22 


19 


16 


WSA/T WAN" 

VV OiVl- VV-Ti-lN 


17 


16 


27 


16 


22 


23 


16 


14 


WSM WTT 

VV OiVl- VV <J 


16 


17 


12 


17 


22 


21 


16 


9 


WSM RS 

V V O iVl— i_J) o 


12 


15 


16 


15 


19 


20 


11 


9 


WSA/T SAA/T 

VV OiVl-OrtiVl 


21 


11 


34 


17 


14 


16 


16 


13 


K = 256 




WSM-FGY 


23 


25 


26 


25 


26 


26 


19 


15 


WSM-LBG 


9 


3 


10 


6 


18 


8 


6 


8 


WSM-MMX 


36 


33 


49 


28 


43 


32 


22 


14 


WSM-DEN 


23 


24 


26 


24 


26 


25 


19 


17 


WSM- VAK 


22 


20 


21 


27 


23 


22 


18 


17 


WSM-SFF 


27 


26 


35 


23 


35 


28 


21 


13 


WSM-KPP 


19 


20 


23 


20 


25 


22 


17 


13 


WSM-MC 


29 


23 


47 


28 


40 


32 


17 


10 


WSM-OTT 


21 


22 


19 


20 


28 


23 


18 


14 


WSM-OCT 


18 


21 


21 


20 


22 


18 


18 


14 


WSM-WAN 


20 


22 


21 


25 


22 


19 


20 


11 


WSM-WU 


16 


16 


15 


15 


16 


16 


16 


10 


WSM-BS 


14 


16 


16 


18 


16 


18 


16 


15 


WSM-SAM 


20 


15 


40 


21 


14 


19 


22 


13 



Table 12: Iteration count rank comparison of the WSM variants 



Method 


32 


64 


128 


256 


Mean 


WSM-FGY 


11.75 


11.13 


11.38 


10.75 


11.25 


WSM-LBG 


2.00 


2.00 


2.00 


1.38 


1.84 


WSM-MMX 


11.38 


11.50 


12.63 


13.25 


12.19 


WSM-DEN 


12.13 


11.13 


11.00 


10.00 


11.06 


WSM-VAR 


8.88 


10.25 


7.63 


8.25 


8.75 


WSM-SFF 


8.75 


9.88 


10.38 


10.88 


9.97 


WSM-KPP 


7.75 


7.50 


7.25 


5.88 


7.09 


WSM-MC 


9.00 


11.75 


12.13 


10.38 


10.81 


WSM-OTT 


7.25 


7.00 


6.50 


7.38 


7.03 


WSM-OCT 


9.50 


6.88 


6.75 


5.38 


7.13 


WSM-WAN 


4.88 


5.00 


5.25 


6.88 


5.50 


WSM-WU 


4.25 


3.63 


3.63 


2.25 


3.44 


WSM-BS 


2.25 


3.13 


2.50 


3.63 


2.88 


WSM-SAM 


4.00 


3.25 


4.13 


6.63 


4.50 
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Tabic 13: MSE improvements for the preclustering methods 



Method 


32 


64 


128 


256 


Moan 


MC 


42% 


47% 


49% 


52% 


47% 


OTT 


15% 


17% 


19% 


21% 


18% 


OCT 


34% 


32% 


31% 


27% 


31% 


WAN 


44% 


49% 


52% 


55% 


50% 


wu 


24% 


25% 


26% 


27% 


25% 


BS 


19% 


19% 


19% 


18% 


19% 


SAM 


26% 


30% 


32% 


35% 


31% 
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